Predicting Actions from Static Scenes

نویسندگان

  • Tuan-Hung Vu
  • Catherine Olsson
  • Ivan Laptev
  • Aude Oliva
  • Josef Sivic
چکیده

Human actions naturally co-occur with scenes. In this work we aim to discover action-scene correlation for a large number of scene categories and to use such correlation for action prediction. Towards this goal, we collect a new SUN Action dataset with manual annotations of typical human actions for 397 scenes. We next discover action-scene associations and demonstrate that scene categories can be well identified from their associated actions. Using discovered associations, we address a new task of predicting human actions for images of static scenes. We evaluate prediction of 23 and 38 action classes for images of indoor and outdoor scenes respectively and show promising results. We also propose a new application of geo-localized action prediction and demonstrate ability of our method to automatically answer queries such as “Where is a good place for a picnic?” or “Can I cycle along this path?”.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Im2Flow: Motion Hallucination from Static Images for Action Recognition

Existing methods to recognize actions in static images take the images at their face value, learning the appearances—objects, scenes, and body poses—that distinguish each action class. However, such models are deprived of the rich dynamic structure and motions that also define human activity. We propose an approach that hallucinates the unobserved future motion implied by a single snapshot to h...

متن کامل

Predicting the Percentage of Sway Index from the Static Balance Based on the Anthropometric Dimensions of Construction Workers

Introduction: The purpose of the current study was to predict the percentage of the sway index from the static balance point based on the anthropometric dimensions of construction workers. Material and Methods: This descriptive-analytical study was conducted on 114 construction workers. First, the construction workers were asked to complete the demographic questionnaire and the inclusion crite...

متن کامل

Corpus-Guided Sentence Generation of Natural Images

We propose a sentence generation strategy that describes images by predicting the most likely nouns, verbs, scenes and prepositions that make up the core sentence structure. The input are initial noisy estimates of the objects and scenes detected in the image using state of the art trained detectors. As predicting actions from still images directly is unreliable, we use a language model trained...

متن کامل

Dynamical optical flow of saliency maps for predicting visual attention

Saliency maps are used to understand human attention and visual fixation. However, while very well established for static images, there is no general agreement on how to compute a saliency map of dynamic scenes. In this paper we propose a mathematically rigorous approach to this problem, including static saliency maps of each video frame for the calculation of the optical flow. Taking into acco...

متن کامل

SUNDAy: Saliency Using Natural Statistics for Dynamic Analysis of Scenes

The notion that novelty attracts attention is core to many accounts of visual saliency. However, a consensus has not been reached on how to best define novelty. Various interpretations of novelty lead to different bottom-up saliency models that have been proposed for static images and more recently for dynamic scenes. In previous work, we assumed that a basic goal of the visual system is to loc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014